Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network
نویسندگان
چکیده
This paper describes a speaker recognition model using TwoDimensional Mel-Cepstrum and predictive neural network. The speaker model consists of two networks. The rst one is a self-organizing VQ map(Kohonen's feature map). The second part is the predictive network and learns transitional patterns on the feature map of each speaker's model. TDMC consists of averaged features and dynamic features of the two-dimensional mel-log spectra in the analyzed interval. The measure for speaker recognition is obtained by using a combination of the VQ distortion on the feature map and the prediction error on the predictive network. In the study, text-independent speaker identi cation experiments for 8 speakers were carried out. The experimental results have shown that a combination of a feature map and a predictive network is very e ective, and that the proposed model using TDMC shows the robustness for time interval.
منابع مشابه
Speaker Recognition in Clean and Noisy Environment Using Rbfnn and Aann
In this paper, we propose a speaker recognition system based on features extracted from the speech recorded using close speaking microphone in clean and noisy environment. This system recognizes the speakers from a number of acoustic features that include linear predictive coefficients (LPC), linear predictive cepstral coefficients (LPCC) and Mel-frequency cepstral coefficients (MFCC). RBFNN an...
متن کاملSignificance of formants from difference spectrum for speaker identification
In this paper, we describe a prototype speaker identification system using auto-associative neural network (AANN) and formant features. Our experiments demonstrate that formants extracted from difference spectrum perform significantly better than formants extracted from normal spectrum for the task of speaker identification. We also demonstrate that formants from difference spectrum provide com...
متن کاملSpeaker Recognition Using Gaussian Mixtures Models
Speech signal contains several levels of information. At first it contains information about the spoken message. At second level speech signal also gives information about the speaker identity, his emotional state and so on. The task of speaker recognition can be divided into two parts: speaker identification and speaker verification. Speaker identification is answering the question which one o...
متن کاملA Proportional Study on Feature Extraction Method in Automatic Speech Recognition System
Automatic speech recognition (ASR) has been the focus of many researchers for several years. In speech recognition system is for a computer be able to "hear,” understand," and "act upon" spoken information. The speaker recognition system viewed as working in a Analysis , Feature extraction , Modeling , Testing/Matching techniques .speech processing is to convey information about words, speaker ...
متن کاملThroat Microphone for Speaker Recognition Using AANN
In this paper, we have analyzed the performance of speaker recognition system based on features extracted from the speech recorded using throat microphone in clean and noisy environment. In general, clean speech performs better for speaker recognition system. Speaker recognition in noisy environment, using transducer held at the throat results in a signal that is clean even in noisy. This speak...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996